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ABSTRACT 

The Air Force Logistics Command (AFLC) is revising 
and enhancing its data-processing capabiliti6.s with the development 
of a large-scale, multi-site, on-line, integrated data base 
information system known as the Advanced Logistics System (ALS) . A 
data integrity program is to be built around a Data Base Hanager 
(D6H) , an individual or a group of people who provides the focal 
point for the coordination and control of data base functions. 
Data-related computing system activities serve as the basis for 
developing a statement of responsibilities for a DBM. The DBH can use 
education and indoctrination to create awareness of the role that 
each member of the organization plays in assuring data integrity. 
(Author/WH) 
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PREFACE 



The Air Force Logistics Command (AFLC) currently has under de- 
velopment a large-scale I integrated data-base information system 
known as the Advanced Logistics System (ALS). At the request of AFLC, 
members of Rand*s Information Sciences Department have been supporting 
the ALS implementation in specified technical areas. One of these 
areas has been the development of a data integrity program. 

This report serves as an introduction to the problems and require- 
ments of data integrity in an ALS-type system. It contains guidelines 
for the establishment of a data integrity program built around a new 
organizational element, the Data-Base Manager. It is assumed that 
readers of the report have a basic familiarity with data-processing 
concepts and terminology. 
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SUMMARY 

The Air Force Logistics Command (AFLC) is revising and enhancing 
its data-processing capabilities with the development of a large-scale', 
multi-site, on-line, integrated data-base information system known as 
the Advanced Logistics System (ALS). This report serves as an introduc- 
tion to the problems and requirements of data integrity in an ALS-type 
system. More specifically, it contains guidelines for the establishment 
of a data integrity program built around a Data Base Manager (DBM). 

As ALS-type information systems become larger and more complex, 
they are increasingly sensitive to the ways in which their data bases 
are created, maintained, used, and protected. Inasmuch as these data- 
base functions involve people throughout the entire organization, there 
must be a degree of centralized coordination and control over these 
functions. In this report, that coordination and control—and the 
associated organization, charter, and procedures— are collectively de- 
fined as a data integrity program* 

The key element in a data integrity program is a new organizational 
element, often referred to as the Data Base Manager. The DBM is an in- 
dividual or a group of people that provides the focal point for the coord- 
ination, and control of the data-base functions. This concept is a rela- 
tively recent development in the computing field and— although general 
agreement exists on the need for a DBM— there are many approaches to 
the questions of its responsibilities, organization, and staffing. 

Section II discusses data-related computing system activities as 
the basis for developing a statement of responsibilities for a DBM. 
Two questions are addressed: Which activities should be within the 
purview of the DBM? What should be the degree of the responsibility 
in each case? 

Section III presents alternative approaches to creating a DBM. 
One issue is that of placing the DBM within an organization. The 
nature of the DBM's role requires (1) that the lines of communication 
and authority be sharply drawn, regardless of the position in the organ- 
ization; (2) that there be firm, consistent management support. The 
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DBM will normally be established as the system becoiucs operational. 
However, many of the DBM's activities begin during the system design 
and development phases. With prior planning, much of the work done 
during these phases will be useful to the DBM. In addition, there Is 
a natural transition of people from several of the specialized develop- 
ment groups (e.g., those responsible for data conversion) Into the DBM 
staff. 

Although differences of opinion exist as to how the DBM work should 
be organized and whether staff members should be specialists or gen- 
erallsts, there Is agreement on the personal characteristics required 
for success. These Include substantial data-processing experience, 
excellent cotnmunicatlons skills, and a working familiarity with user 
operations* 

As an essential element in an ALS-type system, the DBM will con- 
tinue to evolve In terras of responsibilities, organization, and staffing. 

A data Integrity program must provide for establishing and main- 
taining an awareness of the role that each member of the organization 
plays In assuring data Integrity. Section IV discusses how this condi- 
tion can be attained through a program of education and indoctrination 
that begins during system development and continues throughout the life 
of the system. The DBM can play an Important role in creating and main- 
taining this awareness. 
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I. INTRODUCTION 

THE NEED FOR A DATA INTEGRITY PROGRAM 

Developers 9 managers , and users of computer-based information sys- 
tems have spent a great amount of time and energy improving the technical 
performance of their systems. Although the track record of technical 
achievement has not been perfect » there have been steady gains. 

In contrast 9 the organizational and procedural aspects of computer* 
based information systems have received relatively little attention. 
As a result, many of the most serious problems are not technical, but 
rather have to do with organizations and procedures [1]. So much con- 
cern has been put into making the system work from a technical stand- 
point that these other aspects have not received their proper share of 
attention. Systems have failed to perform countless times because: 

1. There has been a person-to-person communication failure. 
For example, the user informally requests modifications to 
the logic of programs that process his data. As a result 
of the lack of a standard method of communication, the pro- 
grammer misunderstands the request and codes incorrect logic 
into a program. 

2. Inadequate or incorrect procedures have been used. Data 
files have been destroyed, or at least degraded, innumerable 
times because backup procedures have either been non- 
existent or have been inadequate to cover the particular 
situation. Another problem has been the absence of valida- 
tion procedures and criteria, resulting in incorrect computer 
output being generated and released. Tebles required by com- 
puter programs are improperly; updated or incorrect versions 
are used because procedures are not adequate. 

3. There has been a failure to "human engineer" some part of 
the system. An example is the poor form design that leads 
to an unacceptably high input error rate in the data entry 
process [2]. 
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4» Responsibilities for error resolution have not been clearly 
assigned* 

The above list of typical system problems growing out of pro- 
cedural and organizational shortcomings Is merely representative* 

This report will present an approach that addresses many of the 
organizational and procedural problems related to data Integrity* 

SIMPLE VERSUS COMPLEX INFORMATION SYSTEMS 

Computer-based Information systems come In all sizes and shapes, 
and can be classified In many different ways, depending on the character- 
istics that are chosen to describe them* Three data-related character- 
istics that are useful in classifying a system in a discussion of data 
integrity are (1) the update mode (batch vs* on-line), (2) the access 
scheme (sequential vs. random), and KS) the degree of functional inte- 
gration (if any) in the data base* 

Considering these characteristics, we find on one hand the simple 
batch-oriented system using sequential files and serving a single func- 
tional user. Typical examples are Air Force Logistics Command (AFLC) 
data systems that are largely built around magnetic-tape files* At the 
other extreme is the complex on-line system organized around functionally 
integrated data bases — e.g., the Advance Logistics System (ALS), under 
development by AFLC, with its on-line updating of massive random-access 
data bases* (See i^pendlx*) In between these extremes are systems witn 
varying degrees of complexity* (See Table 1*) 

Experience gained with complex systems, particularly the on-line, 
integrated data-base variety, has demonstrated that many aspects of 
the organizations and procedures that were developed to administer and 
operate simple systems are inappropriate. There is a growing awareness 
of the sensitivity of an entire system to the way in which the data 
base is created, maintained, used, and protected [5]. It becomes 
apparent that a carefully developed effort is required to coordinate 

5f 

Functional integration refers to the combining of data from two 
or more functional areas, such as personnel records and accounting, 
into a single data base [3,4]* 
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these functions across the entire organization. This effort and the 
resulting organization, charter, and procedures may be considered a 
data integi*ity program. 

The potential costs of a failure to maintain the integrity of the 
central data bank are great In terms of time, dollars, and degraded 
performance; and the Issue of data Integrity deserves both early and 
continuing attention. 

Anyone who has lived through the trials and tribulations of de- 
signing. Implementing, and operating an ALS-type system Is well aware 
of the magnitude and complexity of the task of assuring data- Integrity 
In the data base. In addition to presenting guidelines for the estab- 
lishment of a data Integrity program built around a Data Base Manager, 
this report serves as an Introduction to some of the problems and re- 
quirements of data Integrity In an ALS-type system for those who have 
not had such an experience. 

Section II contrasts the complex, ALS-type system with simpler 
systems In terras of key data-base related activities and associated 
Implications. Section III considers Issues associated with the crea- 
tion of a Data Base Manager. Section IV discusses creating a system- 
wide awareness of accountability for data integrity. 
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II. DATA-RELATED ACTIVITIES IN SIHPLE AND 
COMPLEX INFORMATION SYSTEMS 

This section contrasts simple and complex systems, as defined 
above, to Illustrate the need for coordination and control of t'/ie var- 
ious activities. Obviously, however, the comments apply to all computer- 
based systems, and the need becomes greater as the complexity of the 
system Increases [6-9]. 

Following Is an examination of a number of key system activities 
In terms of the requirements they Impose on the design. Implementation, 
and operation of an ALS-type system. 

DATA-ELiMENT DEFINITION 

Data-element definition Is the process of Identifying the attri- 
butes of an Item of Information (data element), such as "part number" 
or "salary." Some of these attributes are data^element size, recording 
mode, format, and relationships to other data elements. 

In simple systems, data«^lement definition Is usually the responsi- 
bility of a programDer/analyst working in conjunction with the user. 
Both the initial definition and subsequent changes are coordinated and 
implemented by these parties. 

Data-element definition in an ALS-type system can be, and usually 
is, a totally different activity. A data element and its attributes 
may be of interest to a number of users from different functional areas. 
These users may have conflicting requirements, resolvable either through 
comprondse or the establishment of additional data elements. For ex- 
ample, the individual files of payroll and personnel records depart- 
ments may each contain a data element known as "employee address." The 
address in the payroll file may identify the location to which all pay- 
roll-related correspondence should be sent; the personnel file, the 
employee's legal address. If information from these files is inte- 
grated, this difference will have to be resolved as described above. 

The nature of an ALS-type system requires that changes in data- 
element definitions be made quickly and with minimum impact on users. 

ERLC 
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Thls condition affects the approach to incorporating data-element defi- 
nitions into user programs. In simple systems, the definitions are 
usually written directly into each program; whereas in the ALS-type 
system, the definitions are often maintained in a system-control table 
and are available for use as required throughout the system. A central 
coordination and control point for data-element definition is essential. 
The nature and degree of control may vary, but the function must exist. 

DATA-BASE STRUCTURE DEFINITION 

Data-base structure definition is the process of selecting a 
method or methods for organizing data elements that satisfies both 
the user's requirements for logical relationships among the data 
elements and the computing system's requirements regarding physical 
organization. 

Sequential processing and a single primary user make structure 
definition relatively straightforward in roost simple systems. In an 
ALS-type system, however, new issues make this activity much more diffi- 
cult. The structure is affected by questions of storage-device char- 
acteristics, access methods, data-management system capabilities, and 
•response-time requirements. Again, coordination is required to insure 
effective structure definition— both initially and as the system changes. 

DATA-BASE CREATION AND INITIAL CERTIFICATION 

Data-base creation and certification involve the construction, to 
design specifications, of a data base, usually from some combination of 
automated and manual sources. This task is often a difficult and peri- 
lous part of implementation; Some particularly critical tasks in an 
ALS-type system are: 

1. Coordinating the process of selecting from among multiple 
sources of data when preparing .to create an integrated data 
base. 

2. Analysis of the source data to determine their suitability, 
a complex task involving not only the validation of specific 
data values but also questions of time phasing, intra-record 
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consistency among data elements, and validation of the source 
selectiotis. 

3. Establishing criteria for the analysis and certification, a 
difficult task that often brings to the surface deficiencies 
in the system design. It requires substantial attention and 
coordination during both the development phase and system 
operation. 

4. Planning and coordination of any clean-up required in prepar- 
ation for the data-base creation. This task, requiring par- 
ticipation of system designers, developers, and users, is 
unforti aate^y often of low-visibility. 

DATA-BASE UPDATE 

Updating is the process of applying current information, in the 
form of transactions 9 to a data base, causing the creation of new 
records or the modification/deletion- of existing records. In an ALS- 
type system, updating is concerned with such questions as: 

1. Which program(s) is authorized to update each data element? 

2. For a given data element, is the same program authorized to 
perform all of the update functions (add, delete, modify)? 

3. How is the authority to update controlled, with respect to 
both operational control and changes in authority? 

4. Under what conditions can updating take place? For instance, 
is updating allowed during only certain times of the day? 

5. How are program changes coordinated with associated changes 
in procedures, personnel assignments, documentation, etc.? 

6. How is background processing of the data base coordinated 
with on-line updating to insure proper time-phasing? 

Implicit In these quest:ions is the need for a degree of control 
and coordination* 

TRAt^SACTION EDITING 

Transaction editing may be broadly defined as the process of ex- 
amining input transactions and accepting or rejecting them, based on 

O 
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prevlously established criteria. Editing is often done through a 
combination of manual and automated procedures. 

This process is normally straightforward in simple systems, involv- 
ing one edit program, the batching of transactions generated from a 
single source, and a relatively long error detection/correction/re- 
submission cycle. 

In an ALS-type system, on-line input from many sources (and the 
short errot-processing cycle inherent in this type of operation) re- 
quires the careful development of input procedures and the thorough 
training of terminal operators — as well as very complete edit logic in 
the transaction-processing programs. To the extent that the data base 
serves multiple users, the definitions of edit criteria must be co- 
ordinated. In addition, continuous monitoring of the editing process 
will help to identify problems In input procedures, edit logic, and/or 
system operation before they can seriously degrade the data base. 

DATA-BASE QUALITY ANALYSIS 

Quality analysis involves continuing and/or periodic analysis of 
the data base to insure that it satisfies the requirements of the system 
specifications with respect to both structure and data values. Because 
the potential for problems is great in an ALS-type system, a coordinated 
approach is required. Some key considerations are: 

1. What is the value of an error-free data base? Can some 
level of data-base degradation be tolerated, and how should 
it be measured? Degradation can be measured in terms of both 
incorrect data values or flaws in the data-base structure. 

2. What kinri of auditing techniques should be used in the anal- 
ysis process [10]? 

3. ^Vhat are organizational responsibilities for: 
a. Specifying evaluation criteria?* 

Probably the same or similar to criteria used both in the analysis 
of source data in preparation for data-base creation and in transaction 
editing. 
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b. Selec«:lng tools and techniques and developing procedures 
to be used In the data-base quality analysis? Specify- 
ing ^ designing) and developing software required to sup- 
port the analysis? 

c. Training and supervising those responsible for the analysis? 

d. Conducting the analysis and coordinating corrective actions? 

DATA-BASE ACCESS CONTROL 

Access control must protect against all unauthorized access , whether 
deliberate or Inadvertent. 

Access control In the typical tape-oriented simple system has not 
been a major problem* Physical control of magnetic tapeCs) and certifi- 
cation of programs authorized to process specified files have normally 
sufficed. 

However 9 significant new problems of access control are Introduced 
by the ALS-type system* The on-line nature of the system makes the 
data base potentially accessable by each system user^ subject only to 
the effectiveness of protective features built Into the hardware 
and software^ and operational constraints [11] • 

An ALS-type data base may be more attractive to the deliberate 
penetrator for two reasons. First » functional Integration of data often 
creates new Information In the sense that the data were not previously 
available for coordinated processing* Second » the centralization of 
data may decrease the exposure the penetrator must risk. A determined 
penetrator can usually breech a system thought to be secure [12] • 

The development and operation of an access-control plan requires 
the coordination of system designers ^ developers ^ operators ^ and users. 

CHECKPOINTING > RESTART, AND DATA-BASE RECOVERY 

The three closely-related activities of checkpointing « restart ^ 
and data-base recovery are among the most critical from the standpoint 
of data integrity [13,14]. On-line operation utilizing direct storage * 
devices creates many new sources of potential trouble. For instance , 
operating systems are typically much more complex than those in simple 
systems, increasing the probability of software-related failure. 

ERIC 
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Checkpointing Is the "snapshot" recording of such pertinent Infor- 
mation about a computer system as the contents of computer memory, 
message queues, the state of the operating system, and the status of 
data-base processing. Checkpointing may also refer to the recording 
of the entire contents, or a selected portion, of a data base. Check- 
pointing helps to protect against unexpected problems In the ftanctlon- 
Ing of a computing system. 

Checkpointing techniques are governed by, and Influence, such con- 
siderations as software efficiency, the allocation of data bases to 
storage devices, and data-base size and activity. 

Restart Is the process of reestablishing the status of the com- 
puting system as It existed at the time of an Interruption, planned or 
otherwise, and resuming system operation. Checkpointing supplies the 
Information needed In the restart procedure. Restart, which often has 
been a troublesome activity In aimple systems. Is a major source of 
concern In the ALS-type system. Error analysis and resolution are 
usually difficult, complex procedures must be developed and followed, 
and actions coordinated among several parties. 

Data-base recovery Is the restoration of a data base after It has 
been damaged, either physically or logically. This task la extremely 
demanding In an ALS-type system. Involving both system users and oper- 
ators. The data-base processing characteristics of continuous update 
and update-ln-place require elaborate procedures to assure recovery 
capability [15]. In fact, data-base recovery Is a major deslgn'con- 
slderatlon In the development of an ALS-type system. 

MAINTENANCE OF CONTROL TABLES 

ALS-type systems make extensive use of control tables of various 
types. Some of these tables are used to control system functions and 
operations. Examples are tables containing the characteristics of 
communication lines and input /output devices » 

Other tables of control information are application-oriented. 
Typical tables of this type contain information on transaction identif- 
ication and validation, data-base characteristics, and update authoriza- 
tions for data-base maintenance. 



ERIC 



•10- 



Because the Impact of changes In these tables Is so far reaching, 
centralized documentation and control of table maintenance Is essential. 
Responsibility for table content should remain with the appropriate 
group, but all changes must be coordinated. 

DATA-BASE ACTIVITY MONITORING 

Activity monitoring Involves the on-going recording and analysis 
of data-base maintenance and usag^. Typical Items of Interest Include 
the number of transactions processed per unit of time and the activity 
ratio (the ratio of records accessed to all records) • 

There are several reasons for the establishment and coordination 
of this data-base activity monitoring in an ALS-type system: (1) it 
assists in the continuing validation of application design, and often 
provides a basis for program modification and redesign; (2) it provides 
input into hardware review and reconfiguration; (3) tc is essential in 
the allocation of data bases to the most appropriate storage medium; 
(4) it supports periodic review of the adequacy of data-base structures 
and provides a basis for restructuring. 

APPLICATION PROGRAM TESTING, CERTIFICATION, AND CONTROL 

Application programs must be tested for (1) concurrence with de- 
sign specifications, (2) successful integration with other programs, 

(3) conformance to constraints imposed by the operating system, and 

(4) such other considerations as programming standards and Impact on 
overall system performance. This testing is required in both initial 
program development and subsequent modifications. Successful testing 
should be followed by a formal process of certification, after which 
the program becomes part of a certified program library. The estab- 
lishment and control of such a library substantially increases the 
quality of system performance. 

Program testing, certification, and control are required in all 
computing systems. But the nature of the ALS-type system greatly in- 
creases their importance. Typically, the responsibilities for these 
activities are spread throughout the organization. However, they are 
interrelated and mu^t be coordinated. 
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The above comments are at least equally applicable to user- 
developed system software. Software programs become extremely complex 
and highly interdependent in an ALS-type system. This complexity, 
combined with more stringent response-time requirements and the po- 
tential for rapid propagation of exrors, makes centralized testing, 
certification, and control essential Il5]. Vendor-supplied software 
must also go through a process of certification and control. 

SYSTEM UTILITIES 

The design and development of program tools to support many of 
the activities described above is a major requirement in an ALS-type 
system. Although various utilities~e. g. , sort, merge, and dump pro- 
grams—are normally available from the vendor, it is often necessary 
to modify them to meet special needs. In addition, it is usually 
necessary to develop unique utilities for the particular system. Ex- 
amples of the latter are special-purpose data-base display and modifi- 
cation programs. Coordinated design and development of these programs 
is highly f'--?irable. 

TRAINING. STANDARDS, AND DOCUMENTATION 

Each of the activities discussed above requires (in varying de- 
grees) consideration of training, standards, and documentation. They 
are mentioned separately to emphasize that an ALS-type system is par- 
ticularly sensitive to careful training, the creation of positive and 
practical standards, and the establishment and maintenance of appro- 
priate and usable documentation. 

***** 

The above discussion of data-related activities illustrates two 
points: 

1. Data integrity in an ALS-type system is complex in terms of 
the problems encountered, the procedures and resources re- 
quired, and the skills needed. 
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2. An ALS-^type system Imposes broad accountability for data 
Integrity In both development and operation of the system. 

These two factors — increased complexity and broad accountability — 
have led to the development of a new organizational element, as ALS- 
type systems have been Implemented, that provides a focal point for the 
Issues and problems described above [7,16]. This Individual or group 
may go by any of several names^-^Data Base Manager and Data Base Admin- 
istrator are the two most frequently used. For consistency, Data Base 
Manager (DBM) Is used In this report In reference both to the person (s) 
Involved and to the office. 

Section III examines some current approaches to the Data Base 
Manager as the cornerstone of a data Integrity program. 
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III. CURRENT APPROACHES TO THE DATA BASE MANAGER 

ANALYSIS OF CURRENT APPROACHES 

Although general agreement exists that a Data Base Manager (DBM) , 
or the equivalent, Is needed In ALS-type systems, such specific Issues 
as a charter of appropriate activities, when the DBM should be estab- 
lished. Its placement In the organization, and the size and skill re- 
quirements for Its staff are being approached In a variety of ways. 

In gathering Information on the Data Base Manager concept, a nura-t 
ber of companies with Information systems containing existing or evolv- 
ing data Integrity programs were visited.* These companies were 
selected for their similarity to the planned ALS. The selection 
criteria were: 

1. Common data banks of at least several hundred million 
characters; 

2. Terminal orientation; 

3. On-line update; 

4. High transaction rate; 

5. Operational for at least a year. 

ROLE OF THE DATA BASE MANAGER 

No consensus has evolved on the subject of a charter of activities, 
or role, for a Data Base Manager [17] . Substantial variation is found 
in the activities of existing DBMs, resulting from such factors as 
personnel capabilities; preexisting organizational responsibilities, 
and the relative maturity of a system. 

How then can the role of a DBM for a particular system be defined? 
One approach is to answer two questions. First, for which activities 
will the DBM be responsible? As noted above, there is no standard 

5k 

North American Rockwell (Downey, Calif.), Hughes Aircraft Company 
(Fullerton, Calif.), Western Airlines (Los Angeles), Continental Air- 
lines (Los Angeles), American Airlines, (Tulsa, Okla.), and Southern 
Pacific Company (San Francisco) . 
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answer to this question. The list of data- related activities in Sec. 
II provides a good starting point. Though not exhaustive, this list 
covers activities that usually concern a DBM. The second question deals 
with the natitre of the DBM*s responsibility for each of the activities. 
Each activity embraces a spectrum of potential responsibility, with 
decisionmaking power at one extreme and administrative/clerical func- 
tions at the other. The role of the DBM, therefore, can be outlined 

» 

in terms of (1) the activities he is responsible for, and. (2) the nature 
of his responsibility for each of the activities. 

The DBM (or equivalent) in each company visited was analyzed in 
terms of whether or not it performs the activities listed in Sec. II; 
and if so, the nature of its responsibilities. There is general agree- 
ment with respect to some of the activities (Table 2). However, all of 
the activities listed were performed by one or more of the DBM's. As 
would be expected, the highest levels of responsibility are associated 
with those activities directly related to the data base. 



Table 2 

DATA-BASE MANAGER RESPONSIBILITIES 



Activity 


Nature of DBM Responsibility 


No 

Consensus 


Decide 


Coordinate 


Administer 


Data-element definition 




X 






Data-base structure 


X 








Data-base creation and 










certification 


X 








Data-base update 








X 


Data-base activity monitoring 








X 


Data-base quality analysis 






X 




Data-base access control 




X 






Checkpoint, restart, and 










data-base recovery 


X 








Maintenance of control tables 


X 








Transactions editing 








X 


Application program testing. 










certification, and control 








X 


System utilities 


X 








Training, standards, and 










documentation 




X 
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FITTING THE DATA BASE MANAGER INTO THE ORGANIZATION 

Different approaches have been followed by managers of ALS-type 

systems in positioning the Data Base Manager organizationally. Figures 

1, 2, and 3 are edited organization charts for three such systems. 

A number of factors have combined to account for these different 

approaches to organization: 

1. The considerable variance in the total set of responsibilities 
from one system to another affects where the DBM is located 
organizationally. For example, if the thrust of responsibil- 
ity is toward user service, the DBM tends to be associated 
with the group that has the most user-orientation — often 
applications programming. 

2. DBM assignments tend to migrate toward groups containing in- 
dividuals that have both technical strength and strong inter- 
ests in such activities. 

3. The preexisting structure of the organization, and the man- 
agement's willingness to change, also affect the DBM's 
locat ion. 

4. The DBM is an evolving group whose location within the organ- 
ization may change as the system matures. 

Despite the differing approaches to organizational placing of the 
DBM, it is important to remember that in each of the companies the 
need for a DBM has been empirically recognized and, within the con- 
straints and considerations described above, a new organizational 
element has been created. 

PLANNING FOR THE DATA BASE MANAGER 

None of the companies visited had had a DBM or its equivalent when 
their systems began operations, yet each soon found such a function 
necessary. Developers of ALS-type systems can profit from this conrion 
experience by planning for a DBM diring the system development phase. 
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Typically, the organization that develops an ALS-type system finds 
that several of its groups either disappear or take on somewhat differ- 
ent roles after the system is implemented. These groups include system 
testing and integration, data conversion, and initial data-base design 
and structuring. Their work is the logical antecedent of many of the 
activities that may be assigned to a DBM. Most of the DBM activities 
listed in Sec. II begin, in one form or another, during the early stages 
of system development. 

Thus, it is logical to plan for the establishment of a DBM, as the 
foundation of a data integrity program, early in the development process 
This planning should cover people, procedures, and special programming. 

• People and Proaedures^-The gradual transition of the 
people in the groups mentioned above into a DBM is a 
natural process. For example, after the integrated 
data base is successfully loaded, a data-conversion 
group as such is no longer needed. However, there is 
a need for people with data-conversion type experience 
to effectively deal with such requirements as data-base 
checkpoint and recovery. Data-conversion people are 
normally concerned with these subjects during the con- 
version planning and implementation. By the same 
token, rtAny of the procedures developed for use in 
system implementation can, with proper planning, have 

a high degree of usefulness after the system becomes 
operational . 

• Special Programning-^mth a reasonable amount of extra 
planning, much of the special programming written to 
support system implementation can be used in an on- 
going data integrity program. An example is the de- 
velopment of standard test cases. With a modest extra 
effort, the test cases employed during system develop- 
ment can be standardized for continuing use after the 
system becomes operational. Another example is the 
data-conversion programs written to load the data base 
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and to deal with exceptions occurring during the loading. 
These programs can be designed for direct use» or easy 
modification and use» by the DBM In data-base recovery. 

In summary, the foundation for the DBM can and should be laid dur- 
ing development and Implementation sc that Its formal establishment Is 
essentially an acknowledgment of an accomplished fact. 

In planning for the DBM, two other obpervatlons are useful: 

1* Inasmuch as the DBM has heavy Interface responsibilities 
among different elements of the computing organization and 
the user community, It Is extremely Important that clean 
lines of communication and authority, both formal and In- 
formal, be established, 

2. As with any group whose responsibilities cut across those 
of other groups. It Is essential that the DBM have firm and 
consistent management support. The DBM will absorb respon- 
sibilities that have been traditionally held by other groups 
within the organization, and clear management support Is 
essential for DBM success. 

STAFFING CONSIDERATIONS 

Organizations with Data Base Managers generally agree on the char- 
acteristics of a successful DBM staff: 

1. Experience: Typically, members have from one to six years 
experience In data processing — the average being two to three 
years. These data-processing personnel normally come from 
either applications programming or computer operations groups. 
However, people from user organizations have also been suc- 
cessfully Integrated Into DBM staffs. 

2. Communication skills: The heavy responsibilities of Inter- 
facing at many levels and among diverse groups places a prem- 
ium on communication skills for the DBM staff. It Is essential 
that they be able to convey accurately their Ideas both orally 
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and m writing. At least as Important Is sensitivity to the 
import of the words and actions of others. These communica- 
tion skills also imply tact fulness in dealing with high-stress 
situations. 

3. The DBM staff must also have had at least an exposure to user 
operations. One of the keys to success is the DBM's ability 
to understand user problems and needs in the user's vocabulary. 

A. The DBM staff should be relatively stable. High turn-over is 
potentially a major problem, as much of the work done by the 
DBM may be viewed by the data-processing staff as system- 
maintenance work and, as such, not particularly attractive or 
stimulating. Data Base Managers have therefore found it neces- 
sary to develop ways of continuously motivating their people. 
One way has been to assign more respondibllity than is normal 
at the DBM itaff salary level. This tactic requires extreme 
care in selecting people, with the result that DBM staffs tend 
to be very high quality. Another way in which DBM people have 
been motivated is by the granting of special privileges. An 
example is providing them with a special terminal having 
privileged access to the data base and other capabilities not 
generally available to the programming staff. Yet another 
means of insuring the stability of the DBM staff is to en- 
courage the development of automated tools and procedures that 
remove much of the drudgery from the work. Though not neces- 
sarily cost-effective, such procedures make the work more 
attractive. 

5. DBM personnel should be of the type who thrive on problems. 

People vary tremendously in their abilities to work effectively 
in the face of uncertainty. Such ability should be one of the 
selection criteria In assembling a DBM staff. 

Most DBM staffs nuibcr from five to ten people, including the 
supervisor. The number appears to be somewhat independent of the size 
snd complexity of the system, the number of programmers involved, and 
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(to a large degree) the number of applications supported. Rather, 
staff size appears to be a function of range of responsibility* 

Two totally different approaches have been followed in organizing 
the work of the DBM staff* One involves the development of a team of 
specialists, each one responsible for an activity such as data-base 
design, data-base language, data communications, system definitions 
(table maintenance, user coordination), procedures, training and forms, 
and ^«^tandards development and enforcement. The other approach has 
been to develop a staff consisting of broadly trained members, each of 
whom, while tending to work in one or two areas, is capable of working 
at an acceptable level in all the areas of responsibility. Both ap- 
proaches appear successful. The key seems to be the supervisor's 
ability to motivate the staff rather than the manner in which respon- 
sibilities are assigned. 

THE FUTURE OF THE DATA BASE MANAGER 

As more experience is gained with ALS-type systems, it is reason- 
able to expect increasing agreement on the role of the DBM. The nature 
of such systems — the complex data integrity requirement and the asso- 
ciated broad accountability — ^will lead to increased responsibilities 
for DBMs. This growth in responsibility will in turn affect the 
organization, size, and type of staff. 

In the long run, however, it is not clear that the DBM will re- 
tain its identity as its role continues to develop and grow. The DBM 
may be but an evolutionary feature in the way complex information sys- 
tems are managed. It is possible, for instance, to conceive of the 
DBM being incorporated into a System Manager or Administrator who would 
be separate from the traditional programming, analysis, and operations 
group, and who would be responsible for the coordination and varying 
degrees of control of all operational system activities in much the 
same way as the DBM is responsible for data-related activities. 



In cne case, the number of applications supported doubled over 
a period of a year-and-a-half with no corresponding increase of the 
EDM staff. 
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IV. EDUCATION/ INDOCTRINATION FOR DATA INTEGRITY 



The development of a pervasive awareness and acceptance of the 
broad data-related accountability discussed above Is necessary in a 
data integrity program for an ALS-type system. Many people in comput- 
ing system organizations are not accustomed to "thinking data integrity,' 
and a data integrity program should address this problem. The key to 
success is the involvement of all personnel through their understanding 
of the impact of their work on the data base. 

This effort should begin during system development and implementa- 
tion. Specific actions that might be taken include: 

1. A series of indoctrination seminars covering such subjects 
as data-base structuring, description of system-control 
tables and their uses, programming standards, and data con- 
version. These seminars would be coordinated by the develop- 
ment manager and conducted by the appropriate groups. The 
seminars would emphasize the interrelationship among the 
many activities, and might be given at several organizational 
levels. 

2. A newsletter that would reinforce and document the seminars 
and provide a forum for constructive comments and suggestions. 

The education/indoctrination effort should continue after system 
implementation. New people will be joining the organization, and there 
will be new developments of interest to the entire staff. At that 
point, responsibility for the effort may logically be assigned to the 
DBM even though the dissemination of information not directly related 
to data integrity may be involved. Specific actions could include: 

1. A tailored indoctrination program for new people that would 
include an overview of the system with emphasis on their 
particular roles and a reading program to bring them up to 
speed in appropriate areas. 
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2. Continuation of the seminar series » with the emphasis shift- 
ing to new developments. 

The success of this effort requires the commitment of management 
and the resulting willingness to allocate resources to carry It out. 
The mechanisms for Implementation suggested above are not obviously 
"productive" In the normal sense , and there would be great temptation 
to bypass or devitalize them under pressures of time and/or budget 
limitations. 



V. CONaUSIONS 



Information systems organized around large, integrated data bases 
require a high degree of coordination and control of data-related ac- 
tivities. Developers and operators of such systems have found that an 
effective way of accomplishing this has been the creation of a Data 
Base Manager /Administrator (DBM) . 

At this time there appears no optimum location for the DBM within 
an organization. Several successful approaches have been followed by 
companies that have implemented DBMs. The most important factor in 
placing the DBM is the over-all thrust of its responsibilities. 

In the case of an information system under development, it is 
possible, with modest additional efforts in many of the activities, to 
develop the people, procedures, and program tools that will eventually 
constitute the DBM. This is particularly true with respect to activ- 
ities such as data conversion and data-base creation. 

The DBM function requires people with the somewhat unusual com- 
. bination of strengths in both technical areas and interpersonal rela- 
tionships. Inasmuch as most DBM staffs have substantial interaction 
with system users, familiarity with user operations is another key 
requirement. 

A continuing program of education and indoctrination for data in- 
tegrity is an essential ingredient in the operation of a complex system. 
Such a program may be administered by a DBM as part of its overall con- 
cern with the issue of data integrity. 

Data Base Managers are playing essential roles in the operation 
of ALS-type information systems. As noted in this report, many ques- 
tions and alternatives must be considered in the establishment of a 
DBM. The exact nature of the specific responsibilities and the authority 
assigned to a DBM may vary from system to system, but central awareness, 
coordination, and control of data- related activities are vital. 
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APPENDIX 

BRIEF DESCRIPTION OF THE ADVANCED LOGISTICS SYSTEM 

The Advanced Logistics System (ALS) represents a substantial re- 
design of the AFLC Logistics management. When fully Implemented, the 
ALS will be built on one of the largest computing systems to date. 
There will be six large- and one medium-scale Installations, each lo- 
cated at a major logistics center. These Installations will eventually 
replace most of the computing equipment presently used by tne Command, 

A key feature will be the creation and maintenance of massive 
data bases at each processing slte% Information from many logistics 
functions will be Integrated Into a relatively few data bases organized 
around logistical processes, thus making possible the development of 
management Information previously unobtainable and the elimination of 
much of the data redundancy Inherent In present systems. 

The data bases will be collectively referred to as the Unified 
Data Bank (UDB), one at each ALS site* The DDE design will be stan- 
dardized, although the contents for each site will be tinlque. 

Much of the ALS processing will be done In real time or near-real 
time. In contrast to the sequentlal-f lie, batch-Input mode character- 
istic of present AFLC data systems* Transactions and queries will be 
Input from remote terminals throughout che Command by AFLC personnel, 
with the system supporting Interactive operation from any of the 
terminals. 
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